2,349 research outputs found

    BlogForever D2.6: Data Extraction Methodology

    Get PDF
    This report outlines an inquiry into the area of web data extraction, conducted within the context of blog preservation. The report reviews theoretical advances and practical developments for implementing data extraction. The inquiry is extended through an experiment that demonstrates the effectiveness and feasibility of implementing some of the suggested approaches. More specifically, the report discusses an approach based on unsupervised machine learning that employs the RSS feeds and HTML representations of blogs. It outlines the possibilities of extracting semantics available in blogs and demonstrates the benefits of exploiting available standards such as microformats and microdata. The report proceeds to propose a methodology for extracting and processing blog data to further inform the design and development of the BlogForever platform

    Calculation of multiple-trait sire reliability for traits included in a dairy cattle fertility index

    Get PDF
    The advent of genetic evaluations for fertility traits in the UK offers valuable information to farmers that can be used to control fertility problems and safeguard against involuntary culling. In addition to estimated genetic merit, proof reliabilities are required to make correct use of this genetic information. Exact reliabilities, based on the inverse of the coefficient matrix, cannot be estimated for large data sets because of computational restrictions. A method to calculate approximate reliabilities was implemented based on a six-trait sire model. Traits considered were interval between first and second calving, interval between first calving and first service, non-return rate 56 days post first service, number of inseminations per conception, daily milk yield at test nearest day 110 and body condition score. Sire reliabilities were calculated in four steps. Firstly, the number of effective daughters was calculated for each bull, separately for each trait, based on total number of daughters and daughter distribution across herd-year-seasons. Secondly, multiple-trait reliabilities were calculated, based on bull daughter contribution, applying selection index theory on independent daughter groups. Thirdly, (great-) grand-daughter contribution was added to the reliability of each bull, using daughter-based reliability of sons and maternal grandsons. An adjustment was made to account for the probability of bull and son or grandson having daughters in the same herd-year-season. Without the adjustment, reliabilities were inflated by proportionately 0·15 to 0·25. Finally, parent (sire and maternal grandsire) contribution was added to the reliability of each bull. The procedure was first tested on a data subset of 28 061 cow records from 285 bulls. Approximate reliabilities were compared with exact estimates based on the inverse of the coefficient matrix. Mean absolute differences ranged from 0·014 to 0·020 for the six traits and correlation between exact and approximate estimates neared unity. In a full-scale application, sire reliability for the fertility traits increased by proportionately 0·47 to 0·79 over single-trait estimates and the number of bulls with a reliability of 0·60 or more increased by 42 to 115%

    The use of mid-infrared spectrometry to predict body energy status of Holstein cows

    Get PDF
    Energy balance, especially in early lactation, is known to be associated with subsequent health and fertility in dairy cows. However, its inclusion in routine management decisions or breeding programs is hindered by the lack of quick, easy, and inexpensive measures of energy balance. The objective of this study was to evaluate the potential of mid-infrared (MIR) analysis of milk, routinely available from all milk samples taken as part of large-scale milk recording and milk payment operations, to predict body energy status and related traits in lactating dairy cows. The body energy status traits investigated included energy balance and body energy content. The related traits of body condition score and energy intake were also considered. Measurements on these traits along with milk MIR spectral data were available on 17 different test days from 268 cows (418 lactations) and were used to develop the prediction equations using partial least squares regression. Predictions were externally validated on different independent subsets of the data and the results averaged. The average accuracy of predicting body energy status from MIR spectral data was as high as 75% when energy balance was measured across lactation. These predictions of body energy status were considerably more accurate than predictions obtained from the sometimes proposed fat-to-protein ratio in milk. It is not known whether the prediction generated from MIR data are a better reflection of the true (unknown) energy status than the actual energy status measures used in this study. However, results indicate that the approach described may be a viable method of predicting individual cow energy status for a large scale of application

    BlogForever D3.2: Interoperability Prospects

    Get PDF
    This report evaluates the interoperability prospects of the BlogForever platform. Therefore, existing interoperability models are reviewed, a Delphi study to identify crucial aspects for the interoperability of web archives and digital libraries is conducted, technical interoperability standards and protocols are reviewed regarding their relevance for BlogForever, a simple approach to consider interoperability in specific usage scenarios is proposed, and a tangible approach to develop a succession plan that would allow a reliable transfer of content from the current digital archive to other digital repositories is presented

    BlogForever D5.2: Implementation of Case Studies

    Get PDF
    This document presents the internal and external testing results for the BlogForever case studies. The evaluation of the BlogForever implementation process is tabulated under the most relevant themes and aspects obtained within the testing processes. The case studies provide relevant feedback for the sustainability of the platform in terms of potential users’ needs and relevant information on the possible long term impact
    corecore